Overview

Dataset info

Number of variables21
Number of observations21597
Missing cells6281 (1.4%)
Duplicate rows0 (0.0%)
Total size in memory3.5 MiB
Average record size in memory168.0 B

Variables types

Numeric18
Categorical2
Boolean1
Date0
URL0
Text (Unique)0
Rejected0
Unsupported0

Warnings

date has a high cardinality: 372 distinct values Warning
sqft_basement has a high cardinality: 304 distinct values Warning
view has 19422 (89.9%) zeros Zeros
waterfront has 2376 (11.0%) missing values Missing
yr_renovated has 17011 (78.8%) zeros Zeros
yr_renovated has 3842 (17.8%) missing values Missing

Variables

bathrooms
Numeric

Distinct count29
Unique (%)0.1%
Missing (%)0.0%
Missing (n)0
Infinite (%)0.0%
Infinite (n)0
Mean2.115826272
Minimum0.5
Maximum8
Zeros (%)0.0%
Mini histogram

Quantile statistics

Minimum0.5
5-th percentile1
Q11.75
Median2.25
Q32.5
95-th percentile3.5
Maximum8
Range7.5
Interquartile range0.75

Descriptive statistics

Standard deviation0.7689842967
Coef of variation0.3634439683
Kurtosis1.279315294
Mean2.115826272
MAD0.6145972614
Skewness0.5197092816
Sum45695.5
Variance0.5913368485
Memory size168.9 KiB
Histogram
Histogram with fixed size bins (bins=29)
Histogram
Histogram with variable size bins (bins=[0.5 0.625 0.875 1.125 1.375 ... 4.125 4.625 5.375 6.125 8. ], "bayesian blocks" binning strategy used)
ValueCountFrequency (%) 
2.5 5377 24.9%
 
1 3851 17.8%
 
1.75 3048 14.1%
 
2.25 2047 9.5%
 
2 1930 8.9%
 
1.5 1445 6.7%
 
2.75 1185 5.5%
 
3 753 3.5%
 
3.5 731 3.4%
 
3.25 589 2.7%
 
Other values (19) 641 3.0%
 

Minimum 5 values

ValueCountFrequency (%) 
0.5 4 < 0.1%
 
0.75 71 0.3%
 
1 3851 17.8%
 
1.25 9 < 0.1%
 
1.5 1445 6.7%
 

Maximum 5 values

ValueCountFrequency (%) 
8 2 < 0.1%
 
7.75 1 < 0.1%
 
7.5 1 < 0.1%
 
6.75 2 < 0.1%
 
6.5 2 < 0.1%
 

bedrooms
Numeric

Distinct count12
Unique (%)0.1%
Missing (%)0.0%
Missing (n)0
Infinite (%)0.0%
Infinite (n)0
Mean3.373199981
Minimum1
Maximum33
Zeros (%)0.0%
Mini histogram

Quantile statistics

Minimum1
5-th percentile2
Q13
Median3
Q34
95-th percentile5
Maximum33
Range32
Interquartile range1

Descriptive statistics

Standard deviation0.9262988945
Coef of variation0.2746053894
Kurtosis49.82183475
Mean3.373199981
MAD0.7335737152
Skewness2.023641235
Sum72851
Variance0.858029642
Memory size168.9 KiB
Histogram
Histogram with fixed size bins (bins=12)
Histogram
Histogram with variable size bins (bins=[ 1. 1.5 2.5 3.5 4.5 5.5 6.5 7.5 10.5 33. ], "bayesian blocks" binning strategy used)
ValueCountFrequency (%) 
3 9824 45.5%
 
4 6882 31.9%
 
2 2760 12.8%
 
5 1601 7.4%
 
6 272 1.3%
 
1 196 0.9%
 
7 38 0.2%
 
8 13 0.1%
 
9 6 < 0.1%
 
10 3 < 0.1%
 
Other values (2) 2 < 0.1%
 

Minimum 5 values

ValueCountFrequency (%) 
1 196 0.9%
 
2 2760 12.8%
 
3 9824 45.5%
 
4 6882 31.9%
 
5 1601 7.4%
 

Maximum 5 values

ValueCountFrequency (%) 
33 1 < 0.1%
 
11 1 < 0.1%
 
10 3 < 0.1%
 
9 6 < 0.1%
 
8 13 0.1%
 

condition
Numeric

Distinct count5
Unique (%)< 0.1%
Missing (%)0.0%
Missing (n)0
Infinite (%)0.0%
Infinite (n)0
Mean3.409825439
Minimum1
Maximum5
Zeros (%)0.0%
Mini histogram

Quantile statistics

Minimum1
5-th percentile3
Q13
Median3
Q34
95-th percentile5
Maximum5
Range4
Interquartile range1

Descriptive statistics

Standard deviation0.6505456357
Coef of variation0.1907856127
Kurtosis0.5192374924
Mean3.409825439
MAD0.5607545412
Skewness1.036037425
Sum73642
Variance0.4232096241
Memory size168.9 KiB
Histogram
Histogram with fixed size bins (bins=5)
ValueCountFrequency (%) 
3 14020 64.9%
 
4 5677 26.3%
 
5 1701 7.9%
 
2 170 0.8%
 
1 29 0.1%
 

Minimum 5 values

ValueCountFrequency (%) 
1 29 0.1%
 
2 170 0.8%
 
3 14020 64.9%
 
4 5677 26.3%
 
5 1701 7.9%
 

Maximum 5 values

ValueCountFrequency (%) 
5 1701 7.9%
 
4 5677 26.3%
 
3 14020 64.9%
 
2 170 0.8%
 
1 29 0.1%
 

date
Categorical

Distinct count372
Unique (%)1.7%
Missing (%)0.0%
Missing (n)0
6/23/2014
 
142
6/25/2014
 
131
6/26/2014
 
131
Other values (369)
21193
ValueCountFrequency (%) 
6/23/2014 142 0.7%
 
6/25/2014 131 0.6%
 
6/26/2014 131 0.6%
 
7/8/2014 127 0.6%
 
4/27/2015 126 0.6%
 
3/25/2015 123 0.6%
 
7/9/2014 121 0.6%
 
4/28/2015 121 0.6%
 
4/22/2015 121 0.6%
 
4/14/2015 121 0.6%
 
Other values (362) 20333 94.1%
 
Max length10
Mean length8.924433949
Min length8
Contains charsFalse
Contains digitsTrue
Contains spacesFalse
Contains non-wordsTrue

floors
Numeric

Distinct count6
Unique (%)< 0.1%
Missing (%)0.0%
Missing (n)0
Infinite (%)0.0%
Infinite (n)0
Mean1.494096402
Minimum1
Maximum3.5
Zeros (%)0.0%
Mini histogram

Quantile statistics

Minimum1
5-th percentile1
Q11
Median1.5
Q32
95-th percentile2
Maximum3.5
Range2.5
Interquartile range1

Descriptive statistics

Standard deviation0.539682791
Coef of variation0.3612101536
Kurtosis-0.4910657592
Mean1.494096402
MAD0.4883540215
Skewness0.6144969756
Sum32268
Variance0.2912575149
Memory size168.9 KiB
Histogram
Histogram with fixed size bins (bins=6)
ValueCountFrequency (%) 
1 10673 49.4%
 
2 8235 38.1%
 
1.5 1910 8.8%
 
3 611 2.8%
 
2.5 161 0.7%
 
3.5 7 < 0.1%
 

Minimum 5 values

ValueCountFrequency (%) 
1 10673 49.4%
 
1.5 1910 8.8%
 
2 8235 38.1%
 
2.5 161 0.7%
 
3 611 2.8%
 

Maximum 5 values

ValueCountFrequency (%) 
3.5 7 < 0.1%
 
3 611 2.8%
 
2.5 161 0.7%
 
2 8235 38.1%
 
1.5 1910 8.8%
 

grade
Numeric

Distinct count11
Unique (%)0.1%
Missing (%)0.0%
Missing (n)0
Infinite (%)0.0%
Infinite (n)0
Mean7.657915451
Minimum3
Maximum13
Zeros (%)0.0%
Mini histogram

Quantile statistics

Minimum3
5-th percentile6
Q17
Median7
Q38
95-th percentile10
Maximum13
Range10
Interquartile range1

Descriptive statistics

Standard deviation1.173199664
Coef of variation0.1532009163
Kurtosis1.135148022
Mean7.657915451
MAD0.9287958624
Skewness0.7882366364
Sum165388
Variance1.376397451
Memory size168.9 KiB
Histogram
Histogram with fixed size bins (bins=11)
ValueCountFrequency (%) 
7 8974 41.6%
 
8 6065 28.1%
 
9 2615 12.1%
 
6 2038 9.4%
 
10 1134 5.3%
 
11 399 1.8%
 
5 242 1.1%
 
12 89 0.4%
 
4 27 0.1%
 
13 13 0.1%
 

Minimum 5 values

ValueCountFrequency (%) 
3 1 < 0.1%
 
4 27 0.1%
 
5 242 1.1%
 
6 2038 9.4%
 
7 8974 41.6%
 

Maximum 5 values

ValueCountFrequency (%) 
13 13 0.1%
 
12 89 0.4%
 
11 399 1.8%
 
10 1134 5.3%
 
9 2615 12.1%
 

id
Numeric

Distinct count21420
Unique (%)99.2%
Missing (%)0.0%
Missing (n)0
Infinite (%)0.0%
Infinite (n)0
Mean4580474288
Minimum1000102
Maximum9900000190
Zeros (%)0.0%
Mini histogram

Quantile statistics

Minimum1000102
5-th percentile512740390
Q12123049175
Median3904930410
Q37308900490
95-th percentile9297300412
Maximum9900000190
Range9899000088
Interquartile range5185851315

Descriptive statistics

Standard deviation2876735716
Coef of variation0.6280431971
Kurtosis-1.260749894
Mean4580474288
MAD2543785476
Skewness0.243225522
Sum9.892450319e+13
Variance8.275608378e+18
Memory size168.9 KiB
Histogram
Histogram with fixed size bins (bins=50)
Histogram
Histogram with variable size bins (bins=[1.00010200e+06 7.60006100e+06 7.60013050e+06 1.15005650e+07 1.15205050e+07 ... 9.83930020e+09 9.83930111e+09 9.84230007e+09 9.84230051e+09 9.90000019e+09], "bayesian blocks" binning strategy used)
ValueCountFrequency (%) 
795000620 3 < 0.1%
 
1825069031 2 < 0.1%
 
2019200220 2 < 0.1%
 
7129304540 2 < 0.1%
 
1781500435 2 < 0.1%
 
3969300030 2 < 0.1%
 
2560801222 2 < 0.1%
 
3883800011 2 < 0.1%
 
2228900270 2 < 0.1%
 
251300110 2 < 0.1%
 
Other values (21410) 21576 99.9%
 

Minimum 5 values

ValueCountFrequency (%) 
1000102 2 < 0.1%
 
1200019 1 < 0.1%
 
1200021 1 < 0.1%
 
2800031 1 < 0.1%
 
3600057 1 < 0.1%
 

Maximum 5 values

ValueCountFrequency (%) 
9900000190 1 < 0.1%
 
9895000040 1 < 0.1%
 
9842300540 1 < 0.1%
 
9842300485 1 < 0.1%
 
9842300095 1 < 0.1%
 

lat
Numeric

Distinct count5033
Unique (%)23.3%
Missing (%)0.0%
Missing (n)0
Infinite (%)0.0%
Infinite (n)0
Mean47.56009299
Minimum47.1559
Maximum47.7776
Zeros (%)0.0%
Mini histogram

Quantile statistics

Minimum47.1559
5-th percentile47.3103
Q147.4711
Median47.5718
Q347.678
95-th percentile47.7497
Maximum47.7776
Range0.6217
Interquartile range0.2069

Descriptive statistics

Standard deviation0.1385517682
Coef of variation0.002913193803
Kurtosis-0.6757902106
Mean47.56009299
MAD0.1148176747
Skewness-0.48552159
Sum1027155.328
Variance0.01919659246
Memory size168.9 KiB
Histogram
Histogram with fixed size bins (bins=50)
Histogram
Histogram with variable size bins (bins=[47.1559 47.18955 47.19365 47.19585 47.2141 ... 47.70015 47.73735 47.74675 47.75945 47.7776 ], "bayesian blocks" binning strategy used)
ValueCountFrequency (%) 
47.6624 17 0.1%
 
47.5491 17 0.1%
 
47.5322 17 0.1%
 
47.6846 17 0.1%
 
47.6711 16 0.1%
 
47.6886 16 0.1%
 
47.6955 16 0.1%
 
47.6647 15 0.1%
 
47.6904 15 0.1%
 
47.686 15 0.1%
 
Other values (5023) 21436 99.3%
 

Minimum 5 values

ValueCountFrequency (%) 
47.1559 1 < 0.1%
 
47.1593 1 < 0.1%
 
47.1622 1 < 0.1%
 
47.1647 1 < 0.1%
 
47.1764 1 < 0.1%
 

Maximum 5 values

ValueCountFrequency (%) 
47.7776 3 < 0.1%
 
47.7775 3 < 0.1%
 
47.7774 1 < 0.1%
 
47.7772 3 < 0.1%
 
47.7771 2 < 0.1%
 

long
Numeric

Distinct count751
Unique (%)3.5%
Missing (%)0.0%
Missing (n)0
Infinite (%)0.0%
Infinite (n)0
Mean-122.2139825
Minimum-122.519
Maximum-121.315
Zeros (%)0.0%
Mini histogram

Quantile statistics

Minimum-122.519
5-th percentile-122.387
Q1-122.328
Median-122.231
Q3-122.125
95-th percentile-121.9798
Maximum-121.315
Range1.204
Interquartile range0.203

Descriptive statistics

Standard deviation0.1407235288
Coef of variation-0.001151451953
Kurtosis1.052120317
Mean-122.2139825
MAD0.1150902288
Skewness0.8848883395
Sum-2639455.38
Variance0.01980311157
Memory size168.9 KiB
Histogram
Histogram with fixed size bins (bins=50)
Histogram
Histogram with variable size bins (bins=[-122.519 -122.466 -122.442 -122.4155 -122.4125 ... -121.7685 -121.7435 -121.6945 -121.411 -121.315 ], "bayesian blocks" binning strategy used)
ValueCountFrequency (%) 
-122.29 115 0.5%
 
-122.3 111 0.5%
 
-122.362 104 0.5%
 
-122.291 100 0.5%
 
-122.372 99 0.5%
 
-122.363 99 0.5%
 
-122.288 98 0.5%
 
-122.357 96 0.4%
 
-122.284 95 0.4%
 
-122.172 94 0.4%
 
Other values (741) 20586 95.3%
 

Minimum 5 values

ValueCountFrequency (%) 
-122.519 1 < 0.1%
 
-122.515 1 < 0.1%
 
-122.514 1 < 0.1%
 
-122.512 1 < 0.1%
 
-122.511 2 < 0.1%
 

Maximum 5 values

ValueCountFrequency (%) 
-121.315 2 < 0.1%
 
-121.316 1 < 0.1%
 
-121.319 1 < 0.1%
 
-121.321 1 < 0.1%
 
-121.325 1 < 0.1%
 

price
Numeric

Distinct count3622
Unique (%)16.8%
Missing (%)0.0%
Missing (n)0
Infinite (%)0.0%
Infinite (n)0
Mean540296.5735
Minimum78000
Maximum7700000
Zeros (%)0.0%
Mini histogram

Quantile statistics

Minimum78000
5-th percentile210000
Q1322000
Median450000
Q3645000
95-th percentile1160000
Maximum7700000
Range7622000
Interquartile range323000

Descriptive statistics

Standard deviation367368.1401
Coef of variation0.6799379417
Kurtosis34.54135858
Mean540296.5735
MAD234028.1757
Skewness4.023364652
Sum1.16687851e+10
Variance1.349593504e+11
Memory size168.9 KiB
Histogram
Histogram with fixed size bins (bins=50)
Histogram
Histogram with variable size bins (bins=[ 78000. 109750. 149950. 150275. 159997.5 ... 2005000. 2590000. 3410000. 3825000. 7700000. ], "bayesian blocks" binning strategy used)
ValueCountFrequency (%) 
350000 172 0.8%
 
450000 172 0.8%
 
550000 159 0.7%
 
500000 152 0.7%
 
425000 150 0.7%
 
325000 148 0.7%
 
400000 145 0.7%
 
375000 138 0.6%
 
300000 133 0.6%
 
525000 131 0.6%
 
Other values (3612) 20097 93.1%
 

Minimum 5 values

ValueCountFrequency (%) 
78000 1 < 0.1%
 
80000 1 < 0.1%
 
81000 1 < 0.1%
 
82000 1 < 0.1%
 
82500 1 < 0.1%
 

Maximum 5 values

ValueCountFrequency (%) 
7700000 1 < 0.1%
 
7060000 1 < 0.1%
 
6890000 1 < 0.1%
 
5570000 1 < 0.1%
 
5350000 1 < 0.1%
 

sqft_above
Numeric

Distinct count942
Unique (%)4.4%
Missing (%)0.0%
Missing (n)0
Infinite (%)0.0%
Infinite (n)0
Mean1788.596842
Minimum370
Maximum9410
Zeros (%)0.0%
Mini histogram

Quantile statistics

Minimum370
5-th percentile850
Q11190
Median1560
Q32210
95-th percentile3400
Maximum9410
Range9040
Interquartile range1020

Descriptive statistics

Standard deviation827.7597612
Coef of variation0.4627984024
Kurtosis3.405519761
Mean1788.596842
MAD640.1909905
Skewness1.447434235
Sum38628326
Variance685186.2222
Memory size168.9 KiB
Histogram
Histogram with fixed size bins (bins=50)
Histogram
Histogram with variable size bins (bins=[ 370. 545. 665. 695. 762.5 ... 4505. 4775. 5485. 6690. 9410. ], "bayesian blocks" binning strategy used)
ValueCountFrequency (%) 
1300 212 1.0%
 
1010 210 1.0%
 
1200 206 1.0%
 
1220 192 0.9%
 
1140 184 0.9%
 
1400 180 0.8%
 
1060 178 0.8%
 
1180 177 0.8%
 
1340 176 0.8%
 
1250 174 0.8%
 
Other values (932) 19708 91.3%
 

Minimum 5 values

ValueCountFrequency (%) 
370 1 < 0.1%
 
380 1 < 0.1%
 
390 1 < 0.1%
 
410 1 < 0.1%
 
420 2 < 0.1%
 

Maximum 5 values

ValueCountFrequency (%) 
9410 1 < 0.1%
 
8860 1 < 0.1%
 
8570 1 < 0.1%
 
8020 1 < 0.1%
 
7880 1 < 0.1%
 

sqft_basement
Categorical

Distinct count304
Unique (%)1.4%
Missing (%)0.0%
Missing (n)0
0.0
12826
?
 
454
600.0
 
217
Other values (301)
8100
ValueCountFrequency (%) 
0.0 12826 59.4%
 
? 454 2.1%
 
600.0 217 1.0%
 
500.0 209 1.0%
 
700.0 208 1.0%
 
800.0 201 0.9%
 
400.0 184 0.9%
 
1000.0 148 0.7%
 
300.0 142 0.7%
 
900.0 142 0.7%
 
Other values (294) 6866 31.8%
 
Max length6
Mean length3.816039265
Min length1
Contains charsFalse
Contains digitsTrue
Contains spacesFalse
Contains non-wordsTrue

sqft_living
Numeric

Distinct count1034
Unique (%)4.8%
Missing (%)0.0%
Missing (n)0
Infinite (%)0.0%
Infinite (n)0
Mean2080.32185
Minimum370
Maximum13540
Zeros (%)0.0%
Mini histogram

Quantile statistics

Minimum370
5-th percentile940
Q11430
Median1910
Q32550
95-th percentile3760
Maximum13540
Range13170
Interquartile range1120

Descriptive statistics

Standard deviation918.1061251
Coef of variation0.4413288862
Kurtosis5.252101951
Mean2080.32185
MAD698.0845425
Skewness1.473215455
Sum44928711
Variance842918.8569
Memory size168.9 KiB
Histogram
Histogram with fixed size bins (bins=50)
Histogram
Histogram with variable size bins (bins=[ 370. 510. 665. 695. 804.5 ... 4755. 5560. 6077.5 8015. 13540. ], "bayesian blocks" binning strategy used)
ValueCountFrequency (%) 
1300 138 0.6%
 
1400 135 0.6%
 
1440 133 0.6%
 
1660 129 0.6%
 
1010 129 0.6%
 
1800 129 0.6%
 
1820 128 0.6%
 
1480 125 0.6%
 
1720 125 0.6%
 
1540 124 0.6%
 
Other values (1024) 20302 94.0%
 

Minimum 5 values

ValueCountFrequency (%) 
370 1 < 0.1%
 
380 1 < 0.1%
 
390 1 < 0.1%
 
410 1 < 0.1%
 
420 2 < 0.1%
 

Maximum 5 values

ValueCountFrequency (%) 
13540 1 < 0.1%
 
12050 1 < 0.1%
 
10040 1 < 0.1%
 
9890 1 < 0.1%
 
9640 1 < 0.1%
 

sqft_living15
Numeric

Distinct count777
Unique (%)3.6%
Missing (%)0.0%
Missing (n)0
Infinite (%)0.0%
Infinite (n)0
Mean1986.620318
Minimum399
Maximum6210
Zeros (%)0.0%
Mini histogram

Quantile statistics

Minimum399
5-th percentile1140
Q11490
Median1840
Q32360
95-th percentile3300
Maximum6210
Range5811
Interquartile range870

Descriptive statistics

Standard deviation685.2304719
Coef of variation0.3449227141
Kurtosis1.591732789
Mean1986.620318
MAD536.1565539
Skewness1.106875397
Sum42905039
Variance469540.7996
Memory size168.9 KiB
Histogram
Histogram with fixed size bins (bins=50)
Histogram
Histogram with variable size bins (bins=[ 399. 680. 829. 975. 994. ... 3755. 3995. 4325. 4945. 6210.], "bayesian blocks" binning strategy used)
ValueCountFrequency (%) 
1540 197 0.9%
 
1440 195 0.9%
 
1560 192 0.9%
 
1500 180 0.8%
 
1460 169 0.8%
 
1580 167 0.8%
 
1610 166 0.8%
 
1800 166 0.8%
 
1720 166 0.8%
 
1620 164 0.8%
 
Other values (767) 19835 91.8%
 

Minimum 5 values

ValueCountFrequency (%) 
399 1 < 0.1%
 
460 2 < 0.1%
 
620 2 < 0.1%
 
670 1 < 0.1%
 
690 2 < 0.1%
 

Maximum 5 values

ValueCountFrequency (%) 
6210 1 < 0.1%
 
6110 1 < 0.1%
 
5790 6 < 0.1%
 
5610 1 < 0.1%
 
5600 1 < 0.1%
 

sqft_lot
Numeric

Distinct count9776
Unique (%)45.3%
Missing (%)0.0%
Missing (n)0
Infinite (%)0.0%
Infinite (n)0
Mean15099.40876
Minimum520
Maximum1651359
Zeros (%)0.0%
Mini histogram

Quantile statistics

Minimum520
5-th percentile1800.8
Q15040
Median7618
Q310685
95-th percentile43307.2
Maximum1651359
Range1650839
Interquartile range5645

Descriptive statistics

Standard deviation41412.63688
Coef of variation2.742666122
Kurtosis285.4958119
Mean15099.40876
MAD13825.41475
Skewness13.07260357
Sum326101931
Variance1715006493
Memory size168.9 KiB
Histogram
Histogram with fixed size bins (bins=50)
Histogram
Histogram with variable size bins (bins=[5.200000e+02 6.755000e+02 8.635000e+02 1.154500e+03 1.351500e+03 ... 2.178025e+05 2.246055e+05 2.942475e+05 5.061020e+05 1.651359e+06], "bayesian blocks" binning strategy used)
ValueCountFrequency (%) 
5000 358 1.7%
 
6000 290 1.3%
 
4000 251 1.2%
 
7200 220 1.0%
 
7500 119 0.6%
 
4800 119 0.6%
 
4500 114 0.5%
 
8400 111 0.5%
 
9600 109 0.5%
 
3600 103 0.5%
 
Other values (9766) 19803 91.7%
 

Minimum 5 values

ValueCountFrequency (%) 
520 1 < 0.1%
 
572 1 < 0.1%
 
600 1 < 0.1%
 
609 1 < 0.1%
 
635 1 < 0.1%
 

Maximum 5 values

ValueCountFrequency (%) 
1651359 1 < 0.1%
 
1164794 1 < 0.1%
 
1074218 1 < 0.1%
 
1024068 1 < 0.1%
 
982998 1 < 0.1%
 

sqft_lot15
Numeric

Distinct count8682
Unique (%)40.2%
Missing (%)0.0%
Missing (n)0
Infinite (%)0.0%
Infinite (n)0
Mean12758.28351
Minimum651
Maximum871200
Zeros (%)0.0%
Mini histogram

Quantile statistics

Minimum651
5-th percentile2002.4
Q15100
Median7620
Q310083
95-th percentile37045.2
Maximum871200
Range870549
Interquartile range4983

Descriptive statistics

Standard deviation27274.44195
Coef of variation2.137783027
Kurtosis151.3956625
Mean12758.28351
MAD10102.39581
Skewness9.524361965
Sum275540649
Variance743895183.7
Memory size168.9 KiB
Histogram
Histogram with fixed size bins (bins=50)
Histogram
Histogram with variable size bins (bins=[6.510000e+02 9.145000e+02 1.056500e+03 1.168000e+03 1.279500e+03 ... 2.177945e+05 2.180110e+05 2.245520e+05 4.364705e+05 8.712000e+05], "bayesian blocks" binning strategy used)
ValueCountFrequency (%) 
5000 427 2.0%
 
4000 356 1.6%
 
6000 288 1.3%
 
7200 210 1.0%
 
4800 145 0.7%
 
7500 142 0.7%
 
8400 116 0.5%
 
4500 111 0.5%
 
3600 111 0.5%
 
5100 109 0.5%
 
Other values (8672) 19582 90.7%
 

Minimum 5 values

ValueCountFrequency (%) 
651 1 < 0.1%
 
659 1 < 0.1%
 
660 1 < 0.1%
 
748 2 < 0.1%
 
750 4 < 0.1%
 

Maximum 5 values

ValueCountFrequency (%) 
871200 1 < 0.1%
 
858132 1 < 0.1%
 
560617 1 < 0.1%
 
438213 1 < 0.1%
 
434728 1 < 0.1%
 

view
Numeric

Distinct count6
Unique (%)< 0.1%
Missing (%)0.3%
Missing (n)63
Infinite (%)0.0%
Infinite (n)0
Mean0.2338627287
Minimum0
Maximum4
Zeros (%)89.9%
Mini histogram

Quantile statistics

Minimum0
5-th percentile0
Q10
Median0
Q30
95-th percentile2
Maximum4
Range4
Interquartile range0

Descriptive statistics

Standard deviation0.7656862012
Coef of variation3.274083927
Kurtosis10.91971254
Mean0.2338627287
MAD0.4218521331
Skewness3.399525635
Sum5036
Variance0.5862753587
Memory size168.9 KiB
Histogram
Histogram with fixed size bins (bins=6)
ValueCountFrequency (%) 
0 19422 89.9%
 
2 957 4.4%
 
3 508 2.4%
 
1 330 1.5%
 
4 317 1.5%
 
(Missing) 63 0.3%
 

Minimum 5 values

ValueCountFrequency (%) 
0 19422 89.9%
 
1 330 1.5%
 
2 957 4.4%
 
3 508 2.4%
 
4 317 1.5%
 

Maximum 5 values

ValueCountFrequency (%) 
4 317 1.5%
 
3 508 2.4%
 
2 957 4.4%
 
1 330 1.5%
 
0 19422 89.9%
 

waterfront
Boolean

Distinct count3
Unique (%)< 0.1%
Missing (%)11.0%
Missing (n)2376
0
19075
1
 
146
(Missing)
 
2376
ValueCountFrequency (%) 
0 19075 88.3%
 
1 146 0.7%
 
(Missing) 2376 11.0%
 

yr_built
Numeric

Distinct count116
Unique (%)0.5%
Missing (%)0.0%
Missing (n)0
Infinite (%)0.0%
Infinite (n)0
Mean1970.999676
Minimum1900
Maximum2015
Zeros (%)0.0%
Mini histogram

Quantile statistics

Minimum1900
5-th percentile1915
Q11951
Median1975
Q31997
95-th percentile2011
Maximum2015
Range115
Interquartile range46

Descriptive statistics

Standard deviation29.37523413
Coef of variation0.01490372347
Kurtosis-0.6576944258
Mean1970.999676
MAD24.56634737
Skewness-0.4694499765
Sum42567680
Variance862.9043803
Memory size168.9 KiB
Histogram
Histogram with fixed size bins (bins=50)
Histogram
Histogram with variable size bins (bins=[1900. 1900.5 1904.5 1909.5 1910.5 ... 2009.5 2011.5 2013.5 2014.5 2015. ], "bayesian blocks" binning strategy used)
ValueCountFrequency (%) 
2014 559 2.6%
 
2006 453 2.1%
 
2005 450 2.1%
 
2004 433 2.0%
 
2003 420 1.9%
 
2007 417 1.9%
 
1977 417 1.9%
 
1978 387 1.8%
 
1968 381 1.8%
 
2008 367 1.7%
 
Other values (106) 17313 80.2%
 

Minimum 5 values

ValueCountFrequency (%) 
1900 87 0.4%
 
1901 29 0.1%
 
1902 27 0.1%
 
1903 46 0.2%
 
1904 45 0.2%
 

Maximum 5 values

ValueCountFrequency (%) 
2015 38 0.2%
 
2014 559 2.6%
 
2013 201 0.9%
 
2012 170 0.8%
 
2011 130 0.6%
 

yr_renovated
Numeric

Distinct count71
Unique (%)0.3%
Missing (%)17.8%
Missing (n)3842
Infinite (%)0.0%
Infinite (n)0
Mean83.63677837
Minimum0
Maximum2015
Zeros (%)78.8%
Mini histogram

Quantile statistics

Minimum0
5-th percentile0
Q10
Median0
Q30
95-th percentile0
Maximum2015
Range2015
Interquartile range0

Descriptive statistics

Standard deviation399.9464139
Coef of variation4.781944279
Kurtosis18.91954345
Mean83.63677837
MAD160.2641776
Skewness4.573385242
Sum1484971
Variance159957.134
Memory size168.9 KiB
Histogram
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%) 
0 17011 78.8%
 
2014 73 0.3%
 
2003 31 0.1%
 
2013 31 0.1%
 
2007 30 0.1%
 
2005 29 0.1%
 
2000 29 0.1%
 
1990 22 0.1%
 
2004 22 0.1%
 
2009 21 0.1%
 
Other values (60) 456 2.1%
 
(Missing) 3842 17.8%
 

Minimum 5 values

ValueCountFrequency (%) 
0 17011 78.8%
 
1934 1 < 0.1%
 
1940 2 < 0.1%
 
1944 1 < 0.1%
 
1945 3 < 0.1%
 

Maximum 5 values

ValueCountFrequency (%) 
2015 14 0.1%
 
2014 73 0.3%
 
2013 31 0.1%
 
2012 8 < 0.1%
 
2011 9 < 0.1%
 

zipcode
Numeric

Distinct count70
Unique (%)0.3%
Missing (%)0.0%
Missing (n)0
Infinite (%)0.0%
Infinite (n)0
Mean98077.95185
Minimum98001
Maximum98199
Zeros (%)0.0%
Mini histogram

Quantile statistics

Minimum98001
5-th percentile98004
Q198033
Median98065
Q398118
95-th percentile98177
Maximum98199
Range198
Interquartile range85

Descriptive statistics

Standard deviation53.51307235
Coef of variation0.0005456177596
Kurtosis-0.8540048606
Mean98077.95185
MAD46.730101
Skewness0.4053221913
Sum2118189526
Variance2863.648913
Memory size168.9 KiB
Histogram
Histogram with fixed size bins (bins=50)
Histogram
Histogram with variable size bins (bins=[98001. 98001.5 98002.5 98004.5 98005.5 ... 98151.5 98183. 98193. 98198.5 98199. ], "bayesian blocks" binning strategy used)
ValueCountFrequency (%) 
98103 602 2.8%
 
98038 589 2.7%
 
98115 583 2.7%
 
98052 574 2.7%
 
98117 553 2.6%
 
98042 547 2.5%
 
98034 545 2.5%
 
98118 507 2.3%
 
98023 499 2.3%
 
98006 498 2.3%
 
Other values (60) 16100 74.5%
 

Minimum 5 values

ValueCountFrequency (%) 
98001 361 1.7%
 
98002 199 0.9%
 
98003 280 1.3%
 
98004 317 1.5%
 
98005 168 0.8%
 

Maximum 5 values

ValueCountFrequency (%) 
98199 317 1.5%
 
98198 280 1.3%
 
98188 136 0.6%
 
98178 262 1.2%
 
98177 255 1.2%
 

Correlations

Missing values

Sample

First rows

bathroomsbedroomsconditiondatefloorsgradeidlatlongpricesqft_abovesqft_basementsqft_livingsqft_living15sqft_lotsqft_lot15viewwaterfrontyr_builtyr_renovatedzipcode
01.003310/13/20141.07712930052047.5112-122.257221900.011800.011801340565056500.0NaN19550.098178
12.253312/9/20142.07641410019247.7210-122.319538000.02170400.025701690724276390.00.019511991.098125
21.00232/25/20151.06563150040047.7379-122.233180000.07700.077027201000080620.00.01933NaN98028
33.004512/9/20141.07248720087547.5208-122.393604000.01050910.019601360500050000.00.019650.098136
42.00332/18/20151.08195440051047.6168-122.045510000.016800.016801800808075030.00.019870.098074
54.50435/12/20141.011723755031047.6561-122.0051230000.038901530.0542047601019301019300.00.020010.098053
62.25336/27/20142.07132140006047.3097-122.327257500.01715?17152238681968190.00.019950.098003
71.50331/15/20151.07200800027047.4095-122.315291850.010600.01060165097119711NaN0.019630.098198
81.00334/15/20151.07241460012647.5123-122.337229500.01050730.017801780747081130.00.019600.098146
92.50333/12/20152.07379350016047.3684-122.031323000.018900.018902390656075700.00.020030.098038

Last rows

bathroomsbedroomsconditiondatefloorsgradeidlatlongpricesqft_abovesqft_basementsqft_livingsqft_living15sqft_lotsqft_lot15viewwaterfrontyr_builtyr_renovatedzipcode
215872.50338/25/20142.08785214004047.5389-121.881507250.022700.022702270553657310.0NaN20030.098065
215882.00331/26/20153.08983420136747.5699-122.288429000.014900.014901400112612300.00.020140.098144
215892.504310/14/20142.09344890021047.5137-122.167610685.025200.02520252060236023NaN0.020140.098056
215903.50433/26/20152.09793600042947.5537-122.3981010000.02600910.035102050720062000.00.020090.098136
215912.50332/19/20152.08299780002147.5773-122.409475000.01180130.013101330129412650.00.020080.098116
215922.50335/21/20143.0826300001847.6993-122.346360000.015300.015301530113115090.00.020090.098103
215932.50432/23/20152.08660006012047.5107-122.362400000.023100.023101830581372000.00.020140.098146
215940.75236/23/20142.07152330014147.5944-122.299402101.010200.010201020135020070.00.020090.098144
215952.50331/16/20152.0829131010047.5345-122.069400000.016000.016001410238812870.0NaN20040.098027
215960.752310/15/20142.07152330015747.5941-122.299325000.010200.010201020107613570.00.020080.098144